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METHODS FOR DISEASE DETECTION 



Background of the Invention 

Many diseases are associated with genomic instability. That is, a disruption in 
genomic stability, such as a mutation, has been linked to the onset or progression of 
disease. Accordingly, various aspects of genomic instability have been proposed as 

5 reliable markers for disease. For example, mutations in the BRCA genes have been 
proposed as markers for breast cancer, and mutations in the p53 cell cycle regulator 
gene have been associated with numerous cancers, especially colorectal cancer. It has 
been suggested that specific mutations might be a basis for molecular screening assays 
for the early stages of certain types of cancer. See, e.g., Sidransky, et al., Science, 

10 256: 102-1^05 (1992). 

The search for genomic disease markers has been especially intense in the area 
of cancer detection. Cancer is characterized by uncontrolled cell growth which can be 
associated with one or more genetic mutations. Such mutations can cause the affected 
cells to avoid cell death. For example, a mutation in a tumor suppressor gene can 

15 cause cells to avoid apoptosis - a type of cell death thought to be under direct genetic 
control. During apoptosis, cells lose their membranes, the cytoplasm condenses, and 
nuclear chromatin is split into oligonucleotide fragments of characteristically short 
length. In fact, those characteristic DNA cleavage patterns have been proposed as an 
assay for apoptosis. 

20 Attempts have been made to identify and use nucleic acid markers that are 

indicative of cancer. However, even when such markers are found, using them to 
screen patient samples, especially heterogeneous samples, has proven unsuccessful 
either due to an inability to obtain sufficient sample material, or due to the low sensitivity 
that results from measuring only a single marker. Simply obtaining an adequate amount 

25 of human DNA from one type of heterogeneous sample, stool, has proven difficult. See 
Villa, et al., Gastroenterol., 110: 1346-1353 (1996) (reporting that only 44.7% of all stool 
specimens, and only 32.6% of stools from healthy individuals produced sufficient DNA 
for mutation analysis). Other reports in which adequate DNA has been obtained have 
reported low sensitivity in identifying a patient's disease status based upon a single 
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cancer-associated mutation. See Eguchi, et al., Cancer, 77: 1707-1710 (1996) (using a 
p53 mutation as a marker for cancer). 

Investigators have attempted to analyze mutations in DNA of tumor cells shed 
into luminal areas, such as the colon, bile ducts, blood vessels and the like. Such 
attempts have only been successful when there is a known mutation and a relatively 
high concentration of cellular material has been found. See e.g., Mulcahy, et al., Ann. 
Oncol. 10 Suppl 4:114-1 17 (1999). No attempts have been made to correlate disease 
status with DNA integrity in shed cellular material. 

Summary of the Invention 

The present invention provides that the integrity of nucleic acids, proteins, and/or 
other cellular components in a biological sample comprising shed cellular material 
indicates the disease status of the patient from whom the sample was obtained. 
According to the invention, tissue or body fluid samples, especially those described 
below, contain shed cellular debris. In healthy patients, such debris is the result of 
apoptotic degradation as part of the normal cell cycle. Apoptosis reduces the integrity 
(intactness) of nucleic acids, proteins, and other cellular components in healthy 
individuals, so that only small fragments exist in the debris that results from the 
apoptotic process (e.g., exfoliated cellular debris). To the contrary, in diseases such as 
cancer in which cell cycle mechanisms are destroyed or impaired, cellular debris 
comprises high-integrity cellular components, such as nucleic acids (i.e., nucleic acids 
that have not been degraded by apoptosis ). 

Methods of the invention comprise using the integrity of cellular components as a 
measure of patient disease status. Integrity is measured as any indication of the 
presence of intact cellular components (e.g., length, molecular weight, secondary, 
tertiary, quaternary structure, etc.) According to methods of the invention, a tissue or 
body fluid specimen containing sloughed cellular debris obtained from a patient having 
a disease contains an amount of intact cellular components (e.g., nucleic acid) that is 
greater than would be expected in such a specimen obtained from a healthy patient. 
Thus, a measure of intact nucleic acid, proteins, or other cellular components in a 
patient sample is indicative of the overall disease status of the patient. The invention is 
equally applicable to human and to veterinary uses. Accordingly, "patient" as defined 
herein means humans or other animals. 
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A healthy patient generally produces cellular debris through normal apoptotic 
degradation, resulting in relatively small fragments of cellular components in tissue and 
body fluid samples, especially luminal samples. Patients having a disease generally 
produce cells and cellular debris, a proportion of which has avoided normal cell cycle 

5 regulation, resulting in relatively large cellular components. Without being held to 
theory, the present invention takes advantage of this and other insights concerning the 
ways in which cells respond to diseases, especially diseases associated with genetic 
abnormalities (either induced or inherited). As a result, it has been discovered that the 
disease status of a patient is determined by analysis of patient cellular components 

10 produced in specimens obtained from the patient. Most preferably, such specimens are 
those most likely to contain sloughed cellular debris. Such specimens include, but are 
not limited to, stool, blood serum or plasma, sputum, pus, colostrum, and others. In 
diseases, such as cancer, in which genomic instabilities or abnormalities have interfered 
with normal cell cycle regulation, specimens such as those identified above contain 

15 relatively intact fragments of cellular components. The presence of such fragments is a 
general diagnostic screen for disease. Any cellular component(s) may be used in 
methods described herein, including, but not limited to, nucleic acids, proteins, 
carbohydrates, sugars, membranes, lipids, receptors, and the like. 

Accordingly, methods of the invention comprise screening a patient for disease 

20 by analysis of the integrity of cellular components in a tissue or body fluid specimen 
obtained from the patient. Preferred specimens include those comprising shed cells or 
cellular debris. Such preferred specimens comprise stool, sputum, urine, bile, 
pancreatic juice, and blood serum or plasma, all of which contain shed cells or cellular 
debris. Methods of the invention are useful as general disease screens, and are 

25 especially useful as screens for cancer. Cancer is an example of a disease thought to 
be associated with genomic instabilities, and specifically with the loss of control over the 
normal cell cycle. Thus, tumor cells are typically intact and routinely are shed into, for 
example, stool, sputum, urine, bile, pancreatic juice, and blood. Such shed cells and 
cellular debris contain higher integrity nucleic acids and other cellular components 

30 compared to those found in specimens obtained from a healthy patient. There are 
numerous ways in which the integrity of cellular components in a patient specimen are 
measured as a screen for disease. 
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In a preferred embodiment, disruption of the integrity of cellular components is 
measured in a stool sample. According to the invention, shed cells and cellular debris 
indicative of disease are shed by various tissues/organs, and eventually appear in the 
forming stool. Accordingly, an analysis of stool provides a diagnostic screen for disease 
5 generally as shown in Example 5 below. Other samples, such as blood, urine, sputum, 
etc. also provide a view of the overall health status of the patient. Follow-up testing is 
used to further diagnose and treat the disease. 

In a preferred embodiment, the cellular component used as a measure of 
disease is a nucleic acid? Nucleic acid integrity preferably is measured by the ability to 

10 amplify nucleic acids in a sample. However, integrity is also measured by sequencing, 
mass spectrometry, X-ray, etc. A preferred method comprises conducting in a tissue or 
body fluid sample an amplification reaction using as a template a nucleic acid locus 
suspected to be in the sample. If the amount of amplification product (amplicon) is 
greater than the amount of amplicon expected to be present in a normal sample (e.g., 

15 one not having the disease being screened), the sample is determined to be positive. In 
some cases, the presence of any amplification product is sufficient to justify a positive 
screen for disease. It is preferable that, in the case of DNA, the amplification reaction is 
a polymerase chain reaction (PCR) or, in the case of RNA, that the amplification 
reaction is reverse transcriptase PCR. Primers are designed to amplify the locus or loci 

20 chosen for analysis. For purposes of the invention a "genomic locus" is any genetic 
element, including but not limited to a coding region of a gene, a non-coding nucleic 
acid region, a regulatory element of a gene, an intron or RNA. It is not required that the 
target genomic loci be associated with any specific disease, as an increase in 
amplifiable nucleic acid is itself diagnostic. 

25 In one preferred embodiment, the presence of a single high molecular weight 

amplicon is a positive screen. Preferably, a fragment of about 1 .3 Kb or greater is 
measured as an indicator of high integrity nucleic acids in the patient sample. 

In a highly-preferred embodiment, a profile of amplification products across a 
range of nucleic acid fragments of different lengths is produced. In a preferred 

30 embodiment, a series of amplification reactions is conducted at a single genomic locus, 
each reaction being designed to amplify a fragment of unique length. If detectable 
amplicon is produced in each reaction, or in a number of reactions greater than 
expected in a sample obtained from a healthy patient, the sample is determined to be 
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positive. For example, attempts are made to amplify fragments of 200 bp, 400 bp, 800 
bp, 1 .3 Kb, i .8 Kb, and 2.4 Kb at the same genomic locus. In a sample obtained from a 
healthy individual (a "normal" sample), it would be expected that little or no amplification 
product is observed, especially when the longer portions of the locus are used as the 

5 template. To the contrary, at least some proportion of cells and cellular debris in a 
sample obtained from a diseased patient will contain intact fragments. 

In another embodiment, a profile of amplification products across a range of 
nucleic acid fragments of different lengths is produced by a series of amplification 
reactions conducted on a series of different genomic loci, each reaction being designed 

10 to amplify a fragment of unique length. If detectable amplicon is produced in each 

reaction, or in a number of reactions greater than expected in a sample obtained from a 
patient not having the disease being screened, the sample is determined to be positive. 

According to methods of the invention, normal samples do not produce 
significant amounts of detectable amplicon at any length significantly greater than the 

is typical apoptotic fragment (about 175 bp). Accordingly, whether primers are spaced to 
amplify fragments of only one length at a given genomic locus, or whether a series of 
amplifications at the locus are conducted, differences are readily observable between 
normal and diseased samples. 

As detailed below, methods of the invention are useful to detect disease in 

20 biological samples comprising shed cells or cellular debris. For example, the presence 
in a patient stool sample of amounts of nucleic acid, preferably DNA, above a 
predetermined threshold for healthy patients is indicative that the patient has a disease. 
Follow-up analysis is used to determine where the disease resides. However, the 
general disease screen is effective independent of the locus of the disease and the 

25 specimen taken for analysis. Thus, while the analysis of nucleic acids in stool is 
predictive of disease generally, it does not necessarily indicate that the disease is of 
gastrointestinal origin. However, follow-up screening based, for example, on mutational 
analysis, is adequate to identify the locus of disease. Numerous mutational analyses 
are known in the art and include, for example, U.S. Patent No. 5,670,325, incorporated 

30 by reference herein. Moreover, the intensity of, for example, bands on nucleic acid gels 
may be indicative of the locus of disease. 

Also in a preferred embodiment, methods of the invention are used to monitor the 
progress of a disease in a patient or in populations of patients. Such longitudinal 
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monitoring provides information on the degree to which integrity of cellular components 
is increasing or decreasing as disease progresses or recedes. Such longitudinal 
monitoring can be used to assess the efficacy of treatments (e.g, chemotherapy, 
antibiotics), and the response of patients to therapeutic interventions. Methods of the 

5 invention can also be used to predict disease flare-up. For example, monitoring 
fluctuations in patient nucleic acid integrity in diseases, such as inflammatory bowel 
disease, is useful to predict onset of disease episodes. According to the invention, 
episodic occurrence of symptoms is tied to an increase in high-integrity cellular 
components, such as nucleic acids. 

io Methods of the invention are also useful to establish patient databases. Such 

databases are used to identify specific patients, to establish where a particular patient 
fits in a disease continuum (based upon the amount of high integrity cellular 
components present in the patient sample as compared to the database in order to 
assist in diagnosis), to follow trends in disease, to predict disease onset, or to compile 

15 statistics on disease frequency, to monitor patient progress and treatment efficacy, and 
the like. 

Methods of the invention are also useful to predict risk for disease and to predict 
disease onset. Levels of cellular component integrity are useful as a quantitative or 
quasi-quantitative measure of disease. Thus, the level of, for example, high integrity 
20 nucleic acid obtained from a patient sample is compared to standards representing 
various stages of disease in order to assess the patient's disease state and prognosis. 

Methods of the invention are also useful to monitor viral and bacterial load. 
Target bacterial or viral nucleic acids or proteins are isolated and analyzed using the 
molecular weight profiles described herein in order to characterize the state of disease 
25 of the patient sample. Thus, methods of the invention are useful to screen patients for 
HIV, and to monitor its progress. 

Methods of the invention are also useful to screen for stroke, heart attack, 
asthma, and arthritis. Those conditions each result in the shedding of cells and cellular 
debris comprising intact or high-integrity cellular components produced by means other 
30 than apoptosis. 

In an alternative embodiment, screening of patient samples by detecting amounts 
of nucleic acid in the sample is combined with an assay for apoptotic cell activity. Such 
assays may be combined with detecting amounts of nucleic acid in a patient sample as 
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a screen for disease status. A positive screen is one that produces both: (1) an amount 
of nucleic acid that is greater than the amount expected to be present in a normal 
sample (e.g., one not having the disease being screened), and (2) an amount of 
apoptotic cell activity that is less than that expected to be present in a normal sample. 

5 In a highly preferred embodiment, methods of the invention comprise analyzing a 
plurality of genomic loci to determine an amount of amplifiable nucleic acid present at 
each locus. Analysis across multiple loci using methods of the invention may increase 
the sensitivity of the screening assay. 

As will be exemplified in detail below, methods of the invention comprise 

io screening a biological sample for an abnormality in a nucleic acid by conducting an 
amplification reaction using as a template a nucleic acid suspected or expected to be in 
the sample; determining an amount of amplification product obtained; comparing the 
amount of amplicon obtained to a standard amount of amplification product; and 
identifying a sample as having an abnormality in a nucleic acid if the amount of 

15 amplification product differs from the standard amount. In a preferred embodiment, a 
standard amount of amplification product is determined by amplification of a locus, or 
portion thereof, being screened (e.g., an intact, wild-type nucleic acid) in a known 
normal sample (one obtained from an individual known not to have the disease being 
screened). Also in preferred embodiments, a standard amount is determined by 

20 reference to the art. In certain embodiments of the invention, the standard amount is 
essentially no detectable amplicon due to the lack of high-integrity nucleic acids in the 
sample. Accordingly, any detectable amplicon in a patient sample is indicative of a 
positive screen. That is the case especially when a large (e.g., 1.8 Kb or 2.4 Kb) 
fragment is being screened. Finally, the standard amount can be a molecular weight 

25 marker on, for example, an electrophoretic gel. 

In a preferred embodiment of the invention, the sample is prepared from a 
specimen selected from the group consisting of stool, sputum, blood, urine, 
cerebrospinal fluid, seminal fluid, saliva, breast nipple aspirate, and biopsy tissue. 
However, any tissue or body fluid specimen may be used according to methods of the 

30 invention. Especially preferred are samples of luminal fluid because such samples are 
generally free of intact, healthy cells. Such samples include blood, urine, bile, 
pancreatic juice, stool, sputum, pus, and the like. 
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Also in a preferred embodiment, the nucleic acid or nucleic acids being 
interrogated is (are) DNA. In a more particular embodiment, the nucleic acid being 
analyzed is selected from a coding region of a gene, or portion thereof, a noncoding 
nucleic acid region, or portion thereof, a regulatory element of a gene or a portion 

5 thereof, and an unidentified fragment of genomic DNA. Also in a preferred embodiment, 
the nucleic acid being interrogated is RNA. As is appreciated by the skilled artisan, any 
genomic locus is amenable to screening according to the invention. The particular 
locus or loci chosen for analysis depends, in part, on the disease being screened, and 
the convenience of the investigator. It is not necessary that the locus or loci chosen for 

10 analysis be correlated with any specific disease because methods of the invention 
contemplate measuring either the total nucleic acid in a sample or amplifiable nucleic 
acid in a sample as an indicator of overall disease status or the presence and/or extent 
of apoptosis in the sample. However, disease-associated loci (those in which a 
mutation is indicative, causative, or otherwise evidence of a disease) can be used. 

15 Preferred disease-associated loci include p53, ape, MSH-2, dec, scr, c-myc, B-catnenin, 
mlh-1, pms-1, pms-2, pol-delta, and bax. 

The amount of amplification product may be determined by any suitable or 
convenient means. Preferably, the amount of amplification product is determined by gel 
electrophoresis. Labels, such as fluorescent or radioactive labels, may be used. The 

20 amounts of amplification product produced may be compared to standard amounts by 
any suitable or convenient means, including, but not limited to visual comparison, 
machine-driven optical comparison, densitometry, mass spectroscopy, hybrid capture, 
and other known means. The amplification reaction itself can be any means for 
amplifying nucleic acid, including, but not limited to PCR, RT-PCR, OLA, rolling circle, 

25 single base extension, and others known in the art. The amplification product can also 
be measured by signal amplification techniques, such as branch chain amplification 
(Chiron). Methods of the invention are useful with any platform for the identification, 
amplification, sequencing, or other manipulation of nucleic acids. For example, 
methods of the invention can be applied to ligase chain reaction, strand displacement 

30 (Becton-Dickinson), and others. 

Also in a preferred embodiment of the invention, a series of amplification 
reactions is conducted on a single genomic locus. Each amplification reaction in the 
series is designed to amplify a fragment of a different length. In a preferred 
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embodirnent, the target fragment lengths are 200 bp, 400 bp, BOO bp, 1.3 Kb, 1.8 Kb, 
and 2.4 Kb. Primers for amplification are designed according to knowledge in the art in 
order to amplify template, if present, of the desired length at the desired locus. A 
positive screen is one that produces amplicon in at least one, and preferably at least 

5 two of the series of amplification reactions. As noted above, a normal sample which 
has undergone or which is undergoing apoptosis typically contains little or no fragments 
of significant length. Thus, a series of amplification reactions targeting fragments from 
about 200 bp to about 2.4 Kb and longer reveals samples that contain nucleic acids that 
have avoided apoptosis as evidenced by the amplification of large fragments. 

io Preferred methods of the invention also comprise conducting amplification 

reactions on a series of different genomic loci. Preferably, from about 2 to about 7 loci 
are used. However, the precise number of interrogated loci is determined by the 
individual investigator based upon the disease to be detected or based upon 
convenience. According to methods of the invention, primers are designed to amplify 

15 nucleic acid (preferably DNA) at each of the chosen loci. A sample in which at least one 
locus, preferably at least two loci, and most preferably at least three loci produce 
detectable amplification product is considered a positive sample. The lengths of 
fragments to be amplified in this assay may be varied, but are preferably at least about 
180 bp each in length. It is not necessary that the same length fragments be amplified 

20 at each of the chosen loci. 

Methods of the invention also comprise conducting a series of amplification 
reactions at a series of different genomic loci. Each amplification reaction in the series 
is designed to amplify a fragment of a different length. Preferably, from about 2 to about 
7 amplification reactions on about 2 to about 7 loci are used. However, the precise 

25 number of interrogated loci is determined by the individual investigator based upon the 
disease to be detected or based upon convenience. In a preferred embodiment, the 
target fragment lengths are 200 bp, 400 bp, 800 bp, 1.3 Kb, 1.8 Kb, and 2.4 Kb. 
Primers for amplification are designed according to knowledge in the art in order to 
amplify template if present. It is preferred, but not necessary, that the same length 

30 fragments be amplified at each of the chosen loci. A positive screen is one that 
produces amplicon in at least one, and preferably at least two of the series of 
amplification reactions and in which at least one locus, preferably at least two loci, and 
most preferably at least three loci produce fletectable amplification product. As noted 
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above, a normal sample which has undergone or which is undergoing apoptosis 
typically contains little or no fragments of significant length. Thus, a series of 
amplification reactions targeting fragments from about 200 bp to about 2.4 Kb and 
longer reveals samples that contain nucleic acids that have avoided apoptosis as 

5 evidenced by the amplification of large fragments. 

Methods of the invention may also be used to assess the integrity of cellular 
components in a biological sample. For example, using DNA as the component to be 
measured, such methods comprise conducting an amplification reaction using at least 
two loci suspected to be in the sample as templates; determining which loci produce 

10 detectable amplicon; and assessing the integrity of DNA in the sample as a function of 
the number of loci producing amplicon. The integrity of DNA in the sample is high when 
amplicon is produced in one or more of the amplification reactions. This method is 
especially useful for determining whether a heterogeneous sample has sufficient nucleic 
acid for measurement. Accordingly, such methods are used to screen or to "qualify" 

is samples for further analysis (e.g., genetic, biochemical, cytological, or other analyses). 
Methods of the invention may also be used to assess fetal abnormalities by 
conducting amplification reactions on nucleic acids in maternal blood. Just as described 
above, the ability to amplify significant amounts of nucleic acid is an indicator of a 
genomic instability. A baseline for comparison of the extent of nucleic acid amplification 

20 can be amounts of nucleic acids from known normal samples. The amount of 

amplification obtained from fetal samples is placed on a continuum, and the investigator 
must analyze any given sample in terms of the amount of fetal nucleic acid produced in 
various disease states and in normal samples. 

Methods of the invention are useful as diagnostic screening methods. Often it is 

25 desirable to perform follow-up testing on a patient in order to- confirm a suspected 

disease state. Such follow-up procedures are determined based upon the disease state 
being interrogated. For example, a colonoscopy may be suggested in a case in which a 
stool sample is positively screened according to methods of the invention. Appropriate 
follow-up with CT scan, PET scan, x-ray, ultrasound, ERCP, Endoscopy, MRI, virtual 

30 colonoscopy, biopsy, or other measures may be appropriate depending upon the 
diagnosis. 

Methods of the invention are useful as screens for a wide range of disease 
states. In addition to colon cancers and adenomas, methods of the invention are useful 
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to screen for other diseases, for example, as screens for lymphomas, or stomach, lung, 
liver, pancreas, prostate, kidney, testicular, bladder, uterus, or ovarian cancers or 
adenomas. In addition to cancer, methods of the invention are useful, for example, as 
screens for diseases such as inflammatory bowel syndrome, inflammatory bowel 

5 disease, Crohn's disease, and others in which a genomic instability is thought to play a 
role. Methods of the invention are especially useful as screens for any disease that 
impairs the proper function of the gastrointestinal system; most especially diseases of 
the colon. Methods of the invention are also useful to screen for apoptosis in a cellular 
sample. The profile of amplifiable DNA in a sample is correlated with proteins that have 

10 been associated with disease. For example up regulation of the apoptosis protein, 
survivin, is correlated with increased amounts of amplifiable DNA, as is the Ras 
oncogene, as well as other oncogenes and their gene products. 

Methods of the invention are also useful as assays for apoptosis. The presence 
of high-integrity fragments or large quantities of nucleic acids in a sample indicates that 

15 the sample was derived from cells that did not proceed through apoptosis. The 
absence of such fragments or quantities indicates that cells that contributed to the 
sample did undergo apoptosis. Accordingly, an apoptotic activity assay of the invention, 
either alone or in combination with other assays for genomic instability, are useful as 
screens for disease. 

20 Finally, methods of the invention can be carried out by hybrid capture. For 

example, hybrid capture and subsequent analysis of the captured fragments can be 
used to determine the nucleic acid integrity of a sample. 

The invention also provides a profile of nucleic acid fragments indicative of 
disease. A preferred profile is obtained through methods described above. Preferred 

25 profiles comprise nucleic acids having between about 200 bp and about 2.4 Kb obtained 
in a patient sample comprising cellular debris according to methods described herein. A 
highly preferred profile contains at least one nucleic acid of at least 1 .3 Kb. 

Other objects and advantages of the invention are apparent upon consideration 
of the following drawings and detailed description thereof. 
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Description of the Drawings 

Figure 1 is a gel photograph showing results of amplification of K-ras (exon 1) 
DNA isolated from stool using forward and reverse primers spaced about 200 bp apart. 
The band intensity relates to the amount of 200 bp product or greater in the sample. 

5 Lanes 1-4 are results from patients with cancer or adenoma, lane 5 is a positive control, 
lanes 6-10 are from patients who did not have cancer or adenoma, lanes 11-12 are 
negative controls, and lanes 13-18 are standards at the approximate molecular weight 
indicated in the figure. Amplifications were graded A through C, A being the most 
intense band, C being the least. 

io Figures 2-4 are gel photographs showing results of amplification of ape {exon 15) 

DNA isolated from stool using forward and reverse primers spaced about 200 bp apart. 
The band intensity relates to the amount of 200 bp product or greater in the sample. 
Lanes 1-4 are results from patients with cancer or adenoma, lane 5 is a positive control, 
lanes 6-10 are from patients who did not have cancer or adenoma, lanes 11-12 are 

15 negative controls, and lanes 13-18 are standards at the approximate molecular weight 
indicated in the figure. Amplifications were graded A through C, A being the most 
intense band, C being the least. 

Figure 5 is a gel photograph showing results of amplification of p53 (exon 5) DNA 
isolated from stool using forward and reverse primers spaced about 200 bp apart. The 

20 band intensity relates to the amount of 200 bp product or greater in the sample. Lanes 
1-4 are results from patients with cancer or adenoma, lane 5 is a positive control, lanes 
6-10 are from patients who did not have cancer or adenoma, lanes 11-12 are negative 
controls, and lanes 13-18 are standards at the approximate molecular weight indicated 
in the figure. Amplifications were graded A through C, A being the most intense band, 

25 C being the least. 

Figure 6 is a gel photograph showing results of amplification of p53 (exon 7) DNA 
isolated from stool using forward and reverse primers spaced about 200 bp apart. The 
band intensity relates to the amount of 200 bp product or greater in the sample. Lanes 
1-4 are results from patients with cancer or adenoma, lane 5 is a positive control, lanes 

30 6-1 0 are from patients who did not have cancer or adenoma, lanes 11-12 are negative 
controls, and lanes 13-18 are standards at the approximate molecular weight indicated 
in the figure. Amplifications were graded A through C, A being the most intense band, 
C being the least. 
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Figure 7 is a gel photograph showing results of amplification of p53 (exon 8) DNA 
isolated from stool using forward and reverse primers spaced about 200 bp apart. The 
band intensity relates to the amount of 200 bp product or greater in the sample. Lanes 
1-4 are results from patients with cancer or adenoma, lane 5 is a positive control, lanes 
5 6-10 are from patients who did not have cancer or adenoma, lanes 11-12 are negative 
controls, and lanes 13-18 are standards at the approximate molecular weight indicated 
in the figure. Amplifications were graded A through C, A being the most intense band, 
C being the least. 

Figure 9-10 are gel photographs of results of amplification of DNA from stool 

10 samples using forward and reverse primers spaced approximately 1 .8 Kb apart. The 
band intensity shows the amount of 1.8 Kb or greater product. Lanes 1, 8, and 9 are 
negative controls, lanes 2, 3, and 5 are results from patients with cancer or adenoma, 
lanes 4, 6, and 7 are results from patients who did not have cancer or adenoma, and 
lanes 10-14 are molecular weight standards. 

15 Figures 1 1 A and B are gel photographs of results of amplification of DNA in 

stool from a total of 30 patients and controls. The band intensity relates to the amount 
of amplifiable DNA in the sample. Lanes N are negative controls, lanes 1.3. 11, and 18 
are results from patients which are indicative of the presence off cancer or adenoma, 
lanes 2, 4, 5-10, 12-17, and 19-30 are results from patients which are indicative of the 

20 absence of cancer or adenoma. The remaining lanes are markers or standards. 

Figure 12 shows a schematic representation of the placement of the primers for 
amplification in a method of the present invention. In this method, a single toward 
primer, Fi, is used in conjunction with a series of reverse primers, Ri to Re, chosen to 
amplify progressively longer portions of the target. 

25 Figure 1 3 shows a schematic representation of the placement of the primers for 

amplification in a method of the present invention. In this method, a series of forward 
and reverse primer pairs, (F^ , Ri) to (F 3 , R3). are chosen to amplify portions of the 
target spaced at intervals along the target. 
Detailed Description of the Invention 

30 The invention provides methods for the analysis of biological samples. Methods 

of the invention provide diagnostically-relevant information based upon the integrity of 
nucleic acids in a biological sample. Normal biological samples (those not having 
indicia of the disease being screened), especially those comprising luminal tissue 
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and/or fluid, typically comprise a majority of short-fragment, low-integrity nucleic acids 
(especially DNA) which are the result of degradation by apoptosis. When a mutation 
has caused genomic instability, the normal cell cycle may be disrupted and apoptotic 
degradation may not occur at the rate expected in a normal sample. Methods of the 

5 invention screen for such disruptions. 

Accordingly, preferred methods of the invention comprise determining an amount 
of amplifiable nucleic acid in a biological sample, and determining whether that amount 
is consistent with an amount expected in a normal sample. In many biological samples, 
especially heterogeneous samples, there may be no detectable amplification product. 

10 That is especially true when longer fragments are used as templates for amplification. 
Generally, the probability that any given set of PCR primers will amplify a DNA fragment 
having a length exceeding the primer distance is expressed as 

% of Fragments Amplified = (FL-PD)/(FL+PD) 

wherein FL is fragment length (in base pairs) and PD is primer distance (in base 

15 pairs). This equation assumes that sample DNA fragment lengths are uniformly 
distributed (i.e., there is no favored locus at which breaks occur). 

in a preferred embodiment, methods of the invention comprise amplifying 
sequences of different length in a sample, if present, in order to generate a profile of 
amplification products indicative of disease or the propensity for disease. In a preferred 

20 method, a sample is exposed to a set of PCR primers comprising a single forward 

primer, which may be a capture probe used to capture target fragments, and a plurality 
of downstream reverse primers which hybridize to portions of a contiguous sequence (if 
present) in the sample. Amplifications using these primers will result in a series of 
amplification products, each having a different length, if the contiguous target sequence 

25 is present in the sample. The length of the amplification products are determined by the 
spacings between the forward primer and each of the downstream reverse primers. An 
example is shown in Figure 12, which is a schematic representation showing placement 
of the primers for amplification. 

If the target sequence, or a portion of it, is present in the sample, amplification 

30 will result in a series of fragments the length of which is dictated by the spacing of the 
primers. According to the principles adduced above, a sample from a diseased patient 
will produce a profile of amplification products in the assay described above that differs 
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from the profile obtained from a sample containing the smaller fragments expected to be 
produced as a result of normal apoptosis. In a preferred embodiment, the forward 
primer is designed to hybridize about 200 bp upstream of the first reverse primer, and 
about 2.3 Kb upstream of the last reverse primer. Other reverse primers are designed 
5 to hybridize at various locations between the first and last reverse primers. Preferred 
intervals between the forward primer and the various reverse primers are 200 bp (F r 
R0, 400 bp (FrR 2 ), 800 bp (FrFfe), 1.3 Kb, (F r R 4 ), 1.8 Kb (F r R 5 ), and 2.3 Kb (Fi-R 6 ). 
The number and spacing of reverse primers is chosen at the convenience of the skilled 
artisan. 

io Also in a preferred embodiment, a hybrid capture probe is used to anchor a 

target sequence, preferably on a solid support (e.g., beads). A plurality of probes are 
then placed at various distances downstream of the capture probe. Those probes can 
be pairs of forward and reverse primers as discussed above, or they can be signal 
amplification probes, such as those used in Ligase Chain Reaction (LCR), and others 

15 used in the identification of sequences. The plurality of probes hybridize along the 
length of a target fragment if the target is present in the sample. Thus, by interrogating 
samples for the presence of the probes, one can determine the integrity of sequences 
present in the sample. This can be done in numerous ways, including, but not limited 
to, hybrid capture, PCR, LCR, strand displacement, branched chain, or other assays 

20 known in the are that incorporate hybrid probes or primers in order to identify or 
quantitate sequence. A sample containing intact (high integrity) nucleic acids 
represents a positive screen according to the invention. In one embodiment, sample is 
placed into wells (e.g., on a 96 well plate) containing support-bound capture probe. The 
capture probe immobilizes a target sequence, if present in the sample. Probes that 

25 hybridize to sequence downstream of the capture probe (downstream probes) are 
placed into each well, such that each downstream probe is spaced a unique distance 
apart from the common capture probe, and each well contains only one type of 
downstream probe. Signal is then generated by, for example, amplification, or by 
standard ELISA procedure followed by amplification, or by LCR, or other methods 

30 mentioned above. The presence of signal in each well indicates the presence of 

se q Uence 0 f a t least the length between the capture probe and the downstream probe. 
In an alternative embodiment, each well receives multiple different downstream probes, 
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which may be distinctly labeled, and the presence of label(s) is correlated with the 
length of sequence presence in the sample. 

A sample from a patient having, for example, cancer will produce amplicon 
between most or all of the primer pairs (depending, inter alia, on the length of the target 

5 fragments, on the spacing of the primers, and where on the target the primers are 
spaced). Such a profile represents a positive screen for disease or the propensity for 
disease. A sample from a patient who does not have indicia of disease results in little or 
no amplification product in the assay described above. In a negative screen there may 
be amplification of small (e.g., 200 bp) fragments but there should be no amplification of 

10 larger fragments (i.e., fragments resulting from amplification between the forward primer 
and spaced-apart reverse primers). In cancer diagnostics, the target fragment may 
optionally be an oncogene, a tumor suppressor, or any other marker associated with 
cancer. However, it is not necessary to use cancer-associated markers in methods of 
the invention, as such methods are based on the general recognition that samples 

15 indicative of disease contain a greater amount of intact nucleic acids and a greater 
amount of long fragment nucleic acids. Accordingly, any convenient target nucleic acid 
locus may be used in the methods of the invention. 

The amplification reactions described above may be conducted according to any 
suitable or convenient protocol and the fragment size of the resulting amplification 

20 products (if any) may be determined by any suitable or convenient means. 

In an alternative embodiment, methods of the invention comprise conducting a 
series of amplification reactions on a contiguous nucleic acid target fragment, each 
application reaction comprising one forward primer and one reverse primer, such that 
pairs of forward and reverse primers are spaced at intervals on a contiguous fragment 

25 suspected to be in the sample. An example of this arrangement is shown in Figure 13. 
Preferably, the spacings between each forward and reverse primer pair are equivalent. 
In a positive screen, the assay described above will result in a series of same-size 
fragments for most if not all of the primer pairs. Such an array of amplification products 
evidences a contiguous target sequence indicative of disease (see above). A sample 

30 from a disease-free patient should produce little or no amplification product, but in any 
case will not produce the contiguous array of amplification products expected from a 
sample containing a relatively intact diagnostic target sequence. 
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Each of the methods described above are based upon the principle that an intact 
nucleic acid, or a segment of an intact nucleic acid, in a sample is diagnostic. Thus, 
variations on the methods described above are contemplated. Such variations include 
the placement of primers, the number of primers used, the target sequence, the method 
5 for identifying sequences, and others. For example, in the method depicted in Figure 
13, and described above, it is not necessary that the numbers of forward and reverse 
primers be equal. A forward primer may, for example, be used to amplify fragments 
between two reverse primers. Other variations in primer pair placement are within the 
skill in the art, as are details of the amplification reactions to be conducted. Finally, as 
io represented in Figures 12 and 13, capture probes may be used in methods of the 
invention in order to isolate a chosen target sequence. 

The following examples provide further details of methods according to the 
invention. For purposes of exemplification, the following examples provide details of the 
use of the method if the present invention in colon cancer detection. Accordingly, while 
15 exemplified in the following manner, the invention is not so limited and the skilled artisan 
will appreciate its wide range of application upon consideration thereof. 

Exemplary Method for the Detection of Colon Cancer 

The following example relates to screening for colon cancer in voided stool 
samples. Based upon the principles upon which the invention is based (see above), the 
20 same analysis can be performed on other samples, such as those mentioned above, 
with the same results as shown herein. 

For the analysis of stool samples, preferred methods of the invention comprise 
obtaining at least a cross-sectional or circumferential portion of a voided stool as taught 
in U.S. patent number 5,741,650, and co-pending, co-owned U.S. patent application 
25 serial number 09/059,718, both of which are incorporated by reference herein. While a 
cross-sectional or circumferential portion of stool is desirable, methods provided herein 
are conducted on random samples obtained from voided stool, which include smears or 
scrapings. Once obtained, the stool specimen is homogenized. A preferable buffer for 
homogenization is one that contains at least 16 mM ethylenediaminetetraacetic acid 
30 (EDTA). However, as taught in co-pending, co-owned U.S. patent application serial 
number 60/122,177, incorporated by reference herein, it has been discovered that the 
use of at least 150 mM EDTA greatly improves the yield of nucleic acid from stool. 
Thus, a preferred buffer for stool homogenization comprises phosphate buffered saline, 
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20-100 mM NaCI or KCI, at least 150 mM EDTA, and optionally a detergent (such as 
SDS) and a proteinase (e.g., proteinase K). 

After homogenization, nucleic acid is preferably isolated from the stool sample. 
Isolation or extraction of nucleic acid is not required in all methods of the invention, as 

5 certain detection techniques can be adequately performed in homogenized stool without 
isolation of nucleic acids. In a preferred embodiment, however, homogenized stool is 
spun to create a supernatant containing nucleic acids, proteins, lipids, and other cellular 
debris. The supernatant is treated with a detergent and proteinase to degrade protein, 
and the nucleic acid is phenol-chloroform extracted. The extracted nucleic acids are 

io then precipitated with alcohol. Other techniques can be used to isolate nucleic acid 
from the sample. Such techniques include hybrid capture, and amplification directly 
from the homogenized stool. Nucleic acids can be purified and/or isolated to the extent 
required by the screening assay to be employed. Total DNA is isolated using 
techniques known in the art. 

15 Screening Assay Protocol 

The size of human DNA fragments obtained above can be determined by 
numerous means. For example, human DNA can be separated using gel 
electrophoresis. A 3% agarose gel is prepared using techniques known in the art. See 
Ausubel et. al., Short Protocols in Molecular Biology, John Wiley & Sones, 1195, pgs. 2- 

20 23-2-24, incorporated by reference herein. The size of human DNA fragments is then 
determined by comparison to known standards. Fragments greater than about 200 bp 
provide a positive screen. While a diagnosis can be made on the basis of the screen 
alone, patients presenting a positive screen are preferably advised to seek follow-up 
testing to render a confirmed diagnosis. 

25 A preferred means for determining human DNA fragment length uses PCR. 

Methods for implementing PCR are well-known. In the present invention, human DNA 
fragments are amplified using human-specific primers. Amplicon of greater than about 
200 bp produced by PCR represents a positive screen. Other amplification reactions 
and modifications of PCR, such as ligase chain reaction, reverse-phase PCR, Q-PCR, 

30 and others may be used to produce detectable levels of amplicon. Amplicon may be 
detected by coupling to a reporter (e.g. fluorescence, radioisotopes, and the like), by 
sequencing, by gel electrophoresis, by mass spectrometry, or by any other means 
known in the art, as long as the length, weight, or other characteristic of the amplicons 
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identifies them by size. 
Examples 

Experiments were conducted to determine whether characteristics of amplifiable 
DNA in stool were predictive of cancer or precancer in patients from whom stools 

5 samples were obtained. In the first experiment, the amount of amplifiable DNA was 
measured in each of several stool samples using PCR amplification to detect DNA 
fragments in the sample of at least 200 base pairs in length. The second experiment 
determined the amount of long fragments (greater than 200 base pair) in the same 
samples, and then determined ratios of long product to short product. The third 

10 experiment determined a profile of amplification products with nucleic acid fragment 
lengths of 200 bp, 400 bp, 800 bp, 1 .3 Kb, 1 .8 Kb and 2.4 Kb. The fourth and fifth 
experiments were clinical studies correlating the integrity of nucleic acids in patient stool 
samples with, overall patient disease status. 
EXAMPLE 1 

15 Stool samples were collected from 9 patients who presented with symptoms or a 

medical history that indicated that a colonoscopy should be performed. Each stool 
sample was frozen. Immediately after providing a stool sample, each patient was given 
a colonoscopy in order to determine the patient's disease status. Based upon the 
colonoscopy results, and subsequent histological analysis of biopsy samples taken 

20 during colonoscopy, individuals were placed into one of two groups: normal or 

abnormal. The abnormal group consisted of patients with cancer or with an adenoma of 
at least 1 cm in diameter. Based upon these results, 4 of the 9 patients were placed 
into the abnormal group. 

The samples were screened by hybrid capturing human DNA, and determining 

25 the amount of amplifiable DNA having at least 200 base pairs. Each frozen stool 
specimen, weighing from 7-33 grams, was thawed and homogenized in 500 mM Tris, 
16 mM EDTA, and 10 mM NaCI, pH 9.0 at a volume, to mass ratio of 3:1. Samples 
were then rehomogenized in the same buffer to a final volume-to-mass ratio of 20:1, 
and spun in glass macro beads at 2356 xg. The supernatant was collected and treated 

30 with SDS and proteinase k. The DNA was then phenol-chloroform extracted and 
precipitated with alcohol. The precipitate was suspended in 10 mM Tris and 1 mM 
EDTA (1 x TE), pH 7.4. Finally, the DNA was treated with Rnase. 
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Human DNA was isolated from the precipitate by sequence-specific hybrid 
capture. Biotynilated probes against portions of the p53, K-ras, and ape genes were 
used. 

The K-ras probe was 5'GTGGAGTATTTGATAGTGTATTAACCTTATGTGTGAC 
5 3' (SEQ ID NO: 1). 

There were two ape probes: apc-1309 was 
5'TTCCAGCAGTGTCACAGCACCCTAGAACCAAATCCAG 3* (SEQ ID NO: 2), and 
apc-1378 was 5'CAGATAGCCCTGGACAAACAATGCCACGAAGCAGAAG 3' (SEQ ID 
NO: 3). 

10 There were four probes against p53, the first (hybridizing to a portion of exon 5) 

was 5TACTCCCCTGCCCTCAACAAGATGTTTTGCCAACTGG3' (SEQ ID NO:4), the 
second (hybridizing to a portion of exon 7) was 

S'ATTTCTTCCATACTACTACCCATCGACCTCTCATCS' (SEQ ID NO: 5), the third, also 
hybridizing to a portion of exon 7 was 
15 5'ATGAGGCCAGTGCGCCTTGGGGAGACCTGTGGCAAGC3' (SEQ ID NO: 6); and 
finally, a probe against exon 8 had the sequence 

5'GAAAGGACAAGGGTGGTTGGGAGTAGATGGAGCCTGG3' (SEQ ID NO: 7). 

A 10 nl aliquot of each probe (20 pmol/capture) was added to a suspension 
containing 300 nl DNA in the presence of 310 \x\ 6M GITC buffer for 2 hours at room 
20 temperature. Hybrid complexes were isolated using streptavid in-coated beads (Dynal). 
After washing, probe-bead complexes were suspended at 25° C for 1 hour in 0.1x TE 
buffer, pH7.4. The suspension was then heated for 4 minutes at 85° C, and the beads 
were removed. 

Captured DNA was then amplified using PCR, essentially as described in U.S. 

25 Patent No. 4,683,202, incorporated by reference herein. Each sample was amplified 
using forward and reverse primers through 7 loci (Kras, exon 1, APC exon 15 (3 
separate loci), p53, exon 5, p53, exon 7, and p53, exon 8) in duplicate (for a total of 14 
amplifications for each locus). Seven separate PCRs (40 cycles each) were run in 
duplicate using primers directed to detect fragments in the sample having 200 base 

30 pairs or more. Amplified DNA was placed on a 4% Nusieve (FMC Biochemical) gel (3% 
Nusieve, 1% agarose), and stained with ethidium bromide (0.5 ng/ml). The resulting 
amplified DNA was graded based upon the relative intensity of the stained gels. The 
results are shown in Figures 1-7. Each Figure represents the results for all 9 patients 
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(including standards) for the seven different loci that were amplified. As shown in the 
Figures, each sample from a patient with cancer or adenoma was detected as a band 
having significantly greater intensity than the bands associated with samples from 
patients who did not have cancer or precancer. All four cancer/adenoma patients 
5 identified using colonoscopy were correctly identified by determining the amount of 
amplifiable DNA 200 base pairs or greater in length. As shown in Figures 1-7, the 
results were the same regardless of which locus was amplified. Accordingly, the 
amount of 200 bp or greater DNA in a sample was predictive of patient disease status. 
EXAMPLE 2 

io An experiment was conducted that was essentially identical to the one described 

above in Example 1 , but forward and reverse primers were placed such that fragments 
of about 1.8 Kb and above were amplified. 

DNA was prepared as described above. Forward and reverse primers were 
spaced so as to hybridize approximately 1.8 Kb apart on three different loci (Kras, exon 

15 1, APC, exon 15, and p53 exon 5). Thirty-three rounds of amplification were performed, 
and the resulting DNA was placed on a 3% agarose gel. The results are shown in 
Figures 8-10. As shown in the Figures (which show results from three separate 
experiments to amplify and detect "long" product), samples from individuals having 
cancer or precancer produced large amounts of high-molecular weight (in this case 1 .8 

20 Kb and above) DNA; whereas samples from patients who did not have cancer or 
precancer produced no DNA in the range of about 1.8 Kb and higher. Thus, the 
presence of high-molecular weight DNA was indicative of the disease status of the 
patient. 
EXAMPLE 3 

25 An experiment was conducted to determine the molecular weight profile of DNA 

from samples collected and prepared as part of a blind study on 30 patients who 
presented at the Mayo Clinic with suspected gastrointestinal disorders. Stool samples 
were obtained, and DNA was isolated as described above. 

Prior to amplification, DNA was isolated from the samples by hybrid capture. 

30 Biotynilated probes against portions of the BRCA1, BRCA2, p53, APC genes were 
used. 

The BRCA1 probe was 
S'GATTCTGAAGAACCAACTTTGTCCTTAACTAGCTCTTS 1 (SEQ ID NO: 8). 
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The BRCA2 probe was 
5 ' CTAAGTTTG AATC C ATG CTTTG CTCTTCTTG ATTATT3 ' (SEQ ID NO 9). 

The APC1 probe was 
5'CAGATAGCCCTGGACAAACCATGCCACCAAGCAGAAG3' (SEQ ID NO 10). 
5 The p53 probe, hybridizing to a portion of exon 5, was 

5TACTCCCCTGCCCTCAACAAGATGTTTTGCCAACTGG3 1 (SEQ ID NO:4). 

The APC2 probe was 
5 , GAAGTTCCTGGATTTTCTGTTGCTGGATGGTAGTTGC3 , (SEQ ID NO 11). 

A 300 aliquot of sample was placed in 300 fil of 6 M guanidine isothiocyanate 
io buffer with 10 nl of each capture probe, and incubated overnight at 25 C. Captured 
DNA was isolated using 100 \i\ capture beads incubated for one hour at room 
temperature. The DNA was eluted off the beads and PCR amplified under standard 
PCR conditions. 

According to methods of the invention, amplification reactions were conducted 
15 using forward and reverse primers through the 5 loci for each sample. Forward and 
reverse primers were spaced to amplify fragments of 200 bp, 400 bp, 800 bp, 1.3 Kb ( 
1.8 Kb, and 2.4 Kb. Each of 30 PCR reactions was run for 36 cycles. Amplicon was 
run on a 3% Seakeam gel, and stained with ethidium bromide. The results are shown in 
Figures 1 1 A and 1 1 B. Each figure represents the results for 1 5 of the 30 patients. 
20 As shown in those figures, patients with cancer or adenoma have an increased 

yield of amplifiable DNA. That is especially true at the 1.8 Kb level and above. Thus, 
patients with cancer or adenoma not only produce more amplifiable DNA in their stool, 
but also produce larger DNA fragments than are produced in the stool of patients who 
do not have cancer. Thus, both an increased yield of amplifiable DNA and the presence 
25 of high molecular weight DNA, especially that at 1 .8 Kb and above, were indicative of 
patient disease status. 
EXAMPLE 4 

In this example, methods of the invention were correlated with clinical outcome in 
numerous patients who had a colorectal adenoma or colorectal cancer as diagnosed 
30 using colonoscopy, and 79 patients who were diagnosed as not having colorectal 
cancer or adenoma. A stool sample was obtained from each of these patients and 
prepared as described above. Fragments of the 5 different loci referred to above were 
amplified using primers spaced 200, 400, 800, 1300, 1800, and 2400 base pairs apart 
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using the protocol described above in Example 3. Each amplification was scored such 
that successful amplification of a fragment received a score of 1, and no amplification 
received a score of 0. Since five loci were interrogated using 6 primer pairs each, the 
maximum score was 30 (successful amplification of all 6 fragments at all five loci). The 
5 cutoff for a positive screen was set at 21 . The results are shown below. 
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Table 1 
Normals 



patient No 


Age 


Score 








P-178 


64 


19 


P- 1 85 


50 


18 


P-033 


56 


16 


p. 177 


67 


14 


P-055 


75 


13 


P-029 


70 


12 


P-079 


63 


12 


P-066 


72 


11 


P-027 


65 


10 


P-054 


72 


9 


P-158 


59 


9 


P-043 


56 


8 


P-009 


73 


7 


P-030 


86 


2 


P-032 


51 


1 


P-068 


58 


1 


P-187 


63 


1 


P-0I8 


68 


0 


P-186 


61 


17 


P-135 


67 


14 


P-120 


75 


13 


P-179 


76 


9 


P-057 


56 


7 


P-143 


65 


6 


P-136 


58 


1 


P-012 


75 


0 
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Table 2 
Adenomas 



Patient No. 


A EC 


Score 


P-003 




29 


P-001 




23 


P-045 




22 


P-162 




21 


P-163 




16 


P-088 




15 


P-050 




13 


P-060 




11 


P-061 




11 


P1058 




10 


P-075 




10 


P-077 




8 


P-024 




7 


P-056 




7 


P-067 




7 


P-025 




6 


P-080 




4 


P-123 




4 


P-048 




3 


P-040 




2 


P-006 




1 


P-004 




0 


P-015 




0 


P-083 




0 


P-047 






P-129 







WO 01/42502 



PCT/USOO/24834 



Table 3 
Carcinomas 



Patient No. 


Ape 


Score 


P-064 




30 


P-103 




30 


P-104 




30 


P-108 




30 


P-101 




29 


P-102 




29 


P-099 




28 


P-I07 




28 


P-110 




26 


P-098 




25 


P-134 




24 


P-062 




23 


P-090 




23 


P-095 




23 


P-093 




22 


P-100 




21 


P-122 




18 


P-084 




15 


P-109 




15 


P-118 




10 


P-138 




10 


P-091 




8 


P-096 




8 


P-053 




7 


P-119 




6 


P-1I7 




5 


P-105 




0 


P-097 







As shown above, methods of the invention are effective in screening for the 
presence of colorectal cancer and adenoma. 
EXAMPLE 5 

In this example, methods of the invention were used to detect non-colonic 
cancers in 28 patients. 

A stool sample was obtained from each of the 28 patients. The sample was 
prepared as described above. Fragments of the 5 different loci referred to above were 
amplified using primers spaced 200, 400, 800, 1300, 1800, and 2400 base pairs apart 
using the protocol described above in Example 3. Each amplification was scored such 
that successful amplification of a fragment received a score of 1, and no amplification 
received a score of 0. Since five loci were interrogated using 6 primer pairs each, the 
maximum obtainable score was 30 (successful amplification of all 6 fragments at all five 
loci). A score of 21 was used as a cutoff between diseased and non-diseased patients. 
The results are shown below. 
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Table 4 
Supercolonic Cancers 



Patient No. 


Supercolonic 
Cancer 


Age 


Score 


P- 145 


Pancreas 


68 


30 


P-164 


Lung CA 


68 


in 


P- 166 


Bile Duct 


52 


if\ 
JU 


P-189 


Bile Duct 


43 


30 


P-190 


Lung CA 


50 


30 


P-019 


Atypical 
Findings in 
Stomach 


71 


29 


P-152 


Lung CA 


77 


Zo 


P-167 


Pancreas 


72 


•JO 


P-011 


Lung CA 


73 


27 


P-153 


Pancreas 


65 


27 


P-165 


Lung CA 


85 


27 


P-170 


Duodenum 


65 


27 


P-182 


Barrett's 
Esophagus 


58 


27 


P-146 


Bile Duct 


63 


26 


P-081 


Barrett's 
Esophagus 


1A 
1 4 


xo 


P-151 


Pancreas 


49 


25 


P-I55 


Lung CA 


60 


25 


P-156 


Lune CA 


57 


25 


P-150 


Pancreas 


78 


23 


P-149 


Esophagus 


59 


19 


P-154 


Esophagus 


80 


19 


P-169 


Pancreas 


71 


19 


P-168 


Lung CA 


63 


18 


P-180 


Pancreas 


67 


13 


P-144 


Esophagus 


59 


9 


P-147 


Stomach 


57 


7 


P-148 


Stomach 


69 


6 


P-171 


Esophagus 


76 


0 



As shown above, methods of the invention successfully screened 18 out of 27 
patients who actually had non-colonic cancer. Only one patient was misdiagnosed as 
having cancer when he did not. Thus, the methods of the invention are useful for non- 
invasive diagnosis of a non-specified cancerous disease state in a patient. 

The threshold of 21 for a positive screen can be changed to accommodate 
desired sensitivities and specificities. For example, if 18 were the false negative results 
shown in Table 4 would be avoided. The skilled artisan knows how to set thresholds 
depending on the patient (e.g., a lower threshold for patients with symptoms than 
patients presenting with no symptoms), the disease being diagnosed, and the desired 
level of sensitivity and specificity. Regardless of the threshold, the principle of the 
invention remains that nucleic acid integrity is a viable marker for disease, and 
especially for cancer. 
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!n addition, the propensity for disease may be measured using methods of the 
invention. For example, periodic molecular weight profiling in accordance with the 
methods of the invention may be used to monitor the disease state of a patient 
presenting no or minimal symptoms. Such longitudinal monitoring will determine 
whether a patient is progressing with increasing amounts of high integrity nucleic acids 
- indicating the desirability for follow-up examination. 
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What is claimed is: 



1 1 . A method for screening for a supercolonic disease in a patient, the method 

2 comprising the steps of: 

3 determining the integrity of nucleic acids in a patient sample comprising shed 

4 cells or cellular debris; and 

5 identifying said patient as having disease if intact nucleic acids are present in 

6 said sample in an amount greater than a predetermined threshold. 

1 2. The method of claim 1 , wherein said disease is cancer or pre-cancer. 

i 3. The method of claim 2, wherein said cancer is selected from the group consisting 



2 of lung cancer, esophageal cancer, prostate cancer, stomach cancer, pancreatic 

3 cancer, liver cancer, cancer of the bile duct, cancer of the duodenum, and lymphoma. 
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